Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents

نویسندگان

  • Hardik Jain
  • Gaurav Gupta
  • Sharad Joshi
  • Nitin Khanna
چکیده

In this digital era, one thing that still holds the convention is a printed archive. Printed documents find their use in many critical domains such as contract papers, legal tenders and proof of identity documents. As more advanced printing, scanning and image editing techniques are becoming available, forgeries on these legal tenders pose a serious threat. Ability to easily and reliably identify source printer of a printed document can help a lot in reducing this menace. During printing procedure, printer hardware introduces certain distortions in printed characters’ locations and shapes which are invisible to naked eyes. These distortions are referred as geometric distortions, their profile (or signature) is generally unique for each printer and can be used for printer classification purpose. This paper proposes a set of features for characterizing text-line-level geometric distortions, referred as geometric distortion signatures and presents a novel system to use them for identification of the origin of a printed document. Detailed experiments performed on a set of thirteen printers demonstrate that the proposed system achieves state of the art performance and gives much higher accuracy under small training size constraint. For four training and six test pages of three different fonts, the proposed method gives 99% classification accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

رفع اعوجاج هندسی متون به‌کمک اطلاعات هندسی خطوط متن

Document images produced by scanners or digital cameras usually have photometric and geometric distortions. If either of these effects distorts document, recognition of words from such a document image using OCR is subject to errors. In this paper we propose a novel approach to significantly remove geometric distortion from document images. In this method first we extract document lines from do...

متن کامل

Fabrication of New 3D Phantom for the measurement of Geometric Distortion in Magnetic Resonance Imaging System

Introduction: Geometric distortion, an important parameter in neurology and oncology. The current study aimed to design and construct a new three-dimensional (3D) phantom using a 3D printer in order to measure geometric distortion and its 3D reproducibility. Material and Methods: In this study, a new phantom ...

متن کامل

Fabrication of New 3D Phantom for Measuring Geometric Distortion in Magnetic Resonance Imaging System

  Introduction: Geometric distortion is a major shortcoming of magnetic resonance imaging (MRI), which has an important influence on the accuracy of volumetric measurements, an important parameter in neurology and oncology. Our goal is to design and construct a new three- dimensional phantom using a 3D printer in order to measure geometric distortion and its reproducibility in...

متن کامل

Local Binary Patterns for Printer Identification based on Texture Analysis

This paper proposes a texture analysis of the printed document based on Local Binary Pattern (LBP) descriptor for the application of printer identification. The LBP provides a statistical description of the pixels’ gray level differences within their neighborhoods. The occurrence histogram of local binary patterns is able to capture the document’s texture modifications by the distortion during ...

متن کامل

Script Identification of Text Words from a Tri Lingual Document Using Voting Technique

In a multi script environment, majority of the documents may contain text information printed in more than one script/language forms. For automatic processing of such documents through Optical Character Recognition (OCR), it is necessary to identify different script regions of the document. In this context, this paper proposes to develop a model to identify and separate text words of Kannada, H...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1706.06651  شماره 

صفحات  -

تاریخ انتشار 2017